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1.  Introduction 


Given  a space  S of  cumulative  distribution  functions  (cdf's)  and 

a weaK  (reflective  and  transitive)  order  relation,  <,  on  S,  a subclass  of 

S is  called  an  ordered  family  of  distributions  (with  respect  to  G £ 5) 

if  it  has  the  form  (F  £ S | F ^ G).  Two  familiar  examples  of  weak 

orderings  are  convex  ordering  and  star  ordering  which  yield  the  increasing 

failure  rate  (IFR)  and  increasing  failure  rate  on  the  average  (IFRA) 

— 

families  when  G(x)  = 1 - e 

Weak  orderings  and  the  ordered  family  model  assumption  have  proved  to 
be  of  considerable  importance  in  the  literature:  (a)  for  providing 
characterizations  of  skewness  and  kurtosis  (Van  Zwet  (1964))  (b)  in  power 

studies  of  certain  rank  tests  (Doksum  (1969)'  as  models  for  the 

lifetimes  of  coherent  systems  (Birnbaum,  Esuo  J Marshall  (1966))  and 
(d)  as  models  for  the  lifetimes  of  systems  subject  to  random  shocks 
(Esuary,  Marshall  and  Proschan  (1973)  and  Barlow  and  Proschan  (1975)). 
^^This  paper  studies  the  problem  of  designing  a single  stage  expert 

A.ij>  k * 

ment  for  the  selection  of  the  distribution(s)  with  the  largest  a-quantile( s ) 
eSip'*-  £ poJ**- 

where  oi  e(o,l)  is  given  and  when  the  model  assumptions  are  specified  by 
an  ordered  family  of  distributions.  More  precisely  suppose  observations 
are  to  be  taken  from  k populations  labeled  II  ,...,11^.  For  each  i in 
Cl,...,k)  let  (a)  F.  denote  the  cdf  of  population  H (b)  (X..:  l<j<ri} 

be  a random  sample  from  n.,  (c)  xa(F^)  be  the  a-quantile  of  F^, 

(d)  Fj-^-j  be  the  cdf  with  the  i^h  smallest  a-quantile,  and  (e) 
denote  the  (unknown)  population  with  cdf  F^-^.  Then 

xa(Fj-.-j)  ...  x (F^-j)  are  the  ordered  a-quantiles.  Assume  samples 
from  different  IL’ s are  independent,  there  is  no  prior  knowledge  of  the 
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pairing  of  IK  and  The  Fi' s come  from  a specified  ordered 

family  of  distributions,  and  the  experimenter  is  interested  in  selection 

of  n(k  t+i), . . . , II(k),  the  t populations  with  largest  a-quantiles. 

We  adopt  the  preferred  population  formulation  in  which  II^  is  called  a 

preferred  population  if  xa(F^)  is  "close"  (in  a sense  made  precise  in 

Section  3)  to  x (Fr,  ^ . , ) and  consider  two  classes  of  selection 
a Lk-t+lJ 

problems: 

1.  selection  of  a subset  of  prespecified  size  s(r_<t,  r<s  < k-t+r) 
so  as  to  include  at  least  r preferred  populations  and 

2.  selection  of  a random  size  subset  containing  at  most  s(l<s<k) 
populations  so  as  to  include  at  least  one  (r=l)  preferred  population. 

The  problem  is  to  design  the  experiment  so  that  (1)  or  (2)  is 
guaranteed  with  at  least  prespecified  probability  P*  no  matter  what 
the  true  vector  of  cdf's.  This  approach  combines  aspects  of  Gupta's 
subset  selection  formulation  (1956,  65)  and  Bechhofer's  indifference  zone 
formulation  (1954)  and  will  be  shown  to  be  a strengthening  of  the  latter. 

Previous  work  on  selection  procedures  for  nonparametric  problems  can 
be  categorized  according  to  (1)  the  parameter  of  interest  for  selction, 

(2)  the  formulation  of  the  design  requirement  and  (3)  the  model  assump- 
tions. One  group  of  papers  considers  problems  of  selection  for  location 
parameters  or  more  complicated  linear  models  usually  under  the  assumption 
that  the  error  distributions  are  continuous  and  symmetric.  They  include: 
Lehmann  (1963),  Bartlett  and  Govindarajulu  (1968),  Randles  (1970),  Bhapkar 
and  Gore  (1971),  Puri  and  Sen  (1972),  Ghosh  (1973)  and  Gupta  and  Huang 
(1974).  Related  papers  include  Patel  (1976)  who  considers  the  problem 
of  selecting  the  distribution  with  the  largest  scale  parameter  when  the 
common  form  of  the  distribution  is  assumed  to  be  TFR  and  Gupta  and  McDonald 
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(1969)  who  consider  selection  in  terms  of  an  arbitrary  parameter  under 
which  the  distributions  form  a stochastically  increasing  family  of  distri- 
butions. A second  group  of  papers  are  those  which  consider  selection  in 
terms  of  a-quantiles.  Rizvi  and  Sobel  (1967),  Rizvi,  Sobel  and  Woodworth 
(1968)  and  Barlow  and  Gupta  (1969)  study  subset  selection  formulations  of 
the  problem  while  Sobel  (1967)  and  Desu  and  Sobel  (1971)  use  an  indifference 
zone  formulation.  For  a complete  review  of  the  literature  see  Lee  and 
Dudewicz  (1974). 


In  contrast,  the  present  work  does  not  assume  the  F^'s  are 
stochastically  ordered  by  a location,  scale  or  other  parameter  as  do  the 
papers  in  the  first  group  nor  does  it  assume  they  are  arbitrary  continuous 
cdf's  as  do  many  of  the  papers  in  the  second  group.  Rather  the  ordered 
family  assumption  falls  between  these  two  extremes. 

Section  2 studies  weak  orderings  and  the  corresponding  ordered 
families.  Sections  3 and  4 formulate  a fixed  size  subset  selection  pro- 
blem and  a random  size  subset  selection  problem  respectively  and  give 
their  solution  for  two  classes  of  distributions.  Finally  Section  5 


describes  some  properties  of  the  two  classes  of  procedures  considered  in 
Sections  3 and  4. 


I 


□ □ 
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Weak  Orderings  and  Ordered  Families  of  Distributions 
Given  a finite  or  infinite  interval,  I,  of  the  real  line,  R,  let  F 
denote  the  set  of  distributions  F which  satisfy: 

(1)  F is  absolutely  continuous  with  respect  to  Lebesgue  measure, 

(2)  S(F)  = {x  e R|F(x+e)  - F(x-e)  > 0 V e > 0},  the  support  of  F, 
is  a convex  set, 

(3)  S(F)  c I. 

In  particular  when  I = R and  [0,°°),  F^  will  be  denoted  as  F and  FQ 
respectively.  For  F e F define  F ^(y)  on  [0,1]  as  F ^(y)  = 
inf{x  e r|f(x)  >_  y}  where  the  infimum  of  the  empty  set  is  taken  to  be  +<*>. 
Remark  2.1.  For  F,G  e F^  the  following  hold 

(i)  S(F)  is  a closed  interval  of  R 

(ii)  1(F)  = {x  e R|0  < F(x)  < 1}  is  an  open  interval  of  R 

(iii)  F is  strictly  increasing  on  the  open  interval  1(F) 

(iv)  F ^( • ) is  continuous  and  strictly  increasing  on  (0,1) 

(v)  <j>(x)  = G ^(F(x))  is  the  unique  strictly  increasing  function  on 

1(F)  such  that  if  X has  cdf  F then  4>(X)  has  cdf  G. 

A weak  ordering, < , on  F^  is  a relation  on  F^  which  is  reflexive 
and  transitive.  Every  weak  ordering  < on  F^  can  be  "extended"  to  a 
partial  order  by  first  defining  an  equivalence  relation  ^ on  F by 
F % G <=*>  F < G and  G < F.  The  resulting  set  of  equivalence  classes 
{E(F)Jf  c Fj)  is  partially  ordered  by  the  relation  < which  is  well  defined 
by  E(F)  < E(G)  <=d>  F < G. 

We  now  define  a class  of  weak  orderings  which  is  a modification  of  a 
class  proposed  by  Panchapakesan  (1969). 

2 

Let  H be  a set  of  functions  h:  I ■+•  I satisfying 

2 

Mx^Xj)  ^maxtx^x^}  V (x^.x^)  c I . 
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Definition  2.1.  For  F,G  c f , F is  said  to  be  H-ordered  wrt  G (written 

F < G)  <=^>  G-1F(h(x  ,x  ))  < h(G_1F(x  ),  G_1F(x  ))  V h e H and 
n J-  ^ J-  ^ 

x , x2  3 X x2  and  h(xx,x2)  e 1(F). 

Lemma  2.1.  H ordering  is  a weak  ordering  on  Fj. 


Proof. 

F ^ G, 
Then  G 


(2.1) 


Reflexivety  is  straightforward.  To  show  transitivity  suppose 
G J and  h e H and  X1»X2  satisfy  x^,x2  an^  h(x^,x2)  e 1(F). 
1F(x1),  G_1F(x2)  and  G_1F(h(x1,x2))  e 1(G)  and  hence 

G_1Fh(x  ,x  ) <_  h(G_1F(xx),  G_1F(x2)) 

<_  max{G-1F(x  ),  G_1F(x2)} 


implies  h(G  ^Ffx^),  G *T(x2))  e 1(G)  by  convexity.  So 


J_1F(h(x  ,x2))  = J~1GG_1F(h(x1,x2)) 


J~1G(h(G“1F(x1),  G-1F(x?)))  by  (2.1) 


< h(J  1F(x  ),  J 1F(x„))  since  G -<  J 

— 1 ^ n 


and  the  proof  is  completed. 


The  motivation  for  introducing  H-ordering  is  that  it  provides  a convenient 
framework  for  embedding  several  well-known  orderings. 

Example  2.1.  Let  I = [0,°°)  and  consist  of  the  single  function 


h(xx,x2)  = [ ~ x2  » xi  5 x2 


» X1  i x2' 


. 
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Now  h(x^,x2)  <_  maxtx^.Xj}  for  any  x^x.^  _>  0 and  so  is  a weak 

ordering  on  FQ.  Furthermore 

G <=^>  G ■''F(t-x)  <_  G 1F(t)  - G ^FCx)  whenever  x,t,t-x  e 1(F) 

1 

<=>  G F is  superadditive  on  1(F). 


In  particular  F G(x)  = 1 - e X <=>  F(t-x)F(x)  >_  F(t)  whenever  x,t 
1 

and  t - x c I(r)  where  F(y)  = 1 - F(y)  <=>  F is  an  absolutely  continuous 
NBU  (New  Better  than  Used)  distribution  with  convex  support. 

Before  giving  further  examples  recall  the  following  definitions.  For 
F,G  e F (i)  f is  convex  ordered  wrt  G (F  < G)  means  G ^Ftx)  is 
convex  on  1(F)  and  (2)  F is  star  ordered  wrt  G (F  G)  means  G ^F(x) 
is  starshaped  on  1(F)  i.e.  G ^F(Xx)  <_  XG  '*'F(x)  whenever  x,Ax  e 1(F) 
and  Ae[0,l].  Convex  ordering  was  introduced  as  an  alternative  to  the 
standarized  third  moment  inequality  definition  of  skewness.  Namely  F G 
is  interpreted  to  mean  "G  is  more  skewed  to  the  right  than  F"  (see  Van  Zwet 
(1964)  pg.  9).  Since  the  star  shaped  property  can  be  thought  of  as  a 
weakening  of  convexity  (see  Bruckner  and  Ostrow  (1962)),  F G can  also 
be  interpreted  as  an  ordering  according  to  skewness.  For  F,G  e F,  F is 
said  to  be  tail  ordered  wrt  G (F  ■<  G)  iff  G ^F(x)  - x is  nondecreasing 
on  1(F).  When  F < G .then  G is  interpreted  as  having  heavier  tails 
than  F (see  Doksum  (1969)). 


Example  2.2.  Take  I = [O,00)  and  H 2 = { h ^ 1 0 <_  A <_  1} 
h^(x^,x2)  = Ax^  + (1-A)x2  <_  maxfx^.x  } vAe[0,l];  hence 
weak  ordering.  It  is  straightforward  to  check  that  F 


where 

H2  ordering  is  a 
G <”r>  G_1F(.  ) is 


:: 


| 

convex  on  1(F).  So  yields  convex  ordering. 

Example  2.3.  Again  let  I = [0,°°)  and  define  = (h^|o  < X 1}  where 
h (x  ,x  ) = Xx  < max{x  ,x„}  V 0 <_  X <_  1 and  so  H defines  a weak 
ordering.  It  is  straightforward  to  verify  that  yields  star  ordering. 

Example  2.4.  Choose  I = R and  = ■{ h ^ 1 0 <^6  < 00 } where 

h(?(x1,x2)  = x,^  - 6 <_max{x1,x2}  V6  t [0,°°);  H 4 ordering  is  a weak  ordering 
which  reduces  to  tail  ordering. 

The  remainder  of  this  section  is  devoted  to  deriving  some  properties 
of  sta-  tail  ordering  which  will  be  used  in  the  later  sections. 

Re-  It  is  straightforward  to  show  that  the  set  of  distributions 

equivalent  to  F e Fq  under  star  ordering  (F  e F under  tail  ordering)  is 
ES,.(F)  = (F( 6x ) 1 0 < 6 < “}  (E  (F)  = {F(x+g)|6  e R}),  the  set  of  all  scale 
changes  (location  shifts)  of  F. 

Lemma  2,2. 

(a)  For  F,G  r.  F , F G <=>  G ^F(x)/x  is  nondecreasing  on 

R+(F)  = {x  > 0|F(x)  < 1). 

(b)  For  F,G  e F,  F < G <^e>  G ‘''F(x)  - x is  nondecreasing  on 
R(F)  = (x  e R|F(x)  < 1}. 

Proof.  Write  R+(F)  as  the  disjoint  union  {x  > 0|f(x)  = 0}  u 1(F)  and 
yx  e {x  > 0|F(x)  = 0},  G ^F(x)/x  = G 1(0)/x  = -°°  and  the  result  follows 
since  G 1'F(x)/x  is  nondecreasing  on  1(F).  The  reverse  implication  is 
obvious  and  the  proof  for  ■<  is  similar. 


An  important  class  of  distributions  preserving  weak  orderings  are  the 


. , ■_  / 


distributions  of  the  order  statistics.  If  X^  ^ (l^q^n)  is  the  qth  order 

statistic  based  on  X, , . . . ,X  iid  F then  P„[X,  n < x]  = B(F(x);q,n) 

1 n t (.  q ) — 


where 


rP 

\ n!  q-1,.  An-q, 

,;<l*n)  = U-q)T(q-TTT  Jn  X (1"X)  dX* 


Theorem  2.1.  If  F,  G e F^.  and  F G then  B(F;q,n)  "fy  B(G;q,n) 
holds  V 1 < q < n.  The  proof  is  immediate  from  the  definition  of  H 
ordering  since  G dF(x)  = G dB  dBF(x)  where  B(  • ) and  B ^(  • ) denote 

B(x;q,n)  and  its  inverse  respectively. 

The  final  part  of  this  section  will  describe  stochastic  bounds  for 
several  classes  of  distributions.  For  G in  F^  with  1(G)  = (0,°°)  and 
0 < 5 < °°  define 

(2.2)  Fft(G)  = {F  e Fq  | F ■<*  G} 

(2.3)  F.,.(G,C)  = {F  e F.v(G)|xa(F)  = O - 

Remark  2.3.  F.,{(G)  = F.,.(GX)  V A > 0 where  ^(x)  = G(Ax)  since  VA  > 0, 

Gx  e Ea(G)  and  F e F*(G)  <=>  F -k  G <=^  F Gx  <=>  F e F.,.^). 

Example  2.5.  Let  G(x)  = 1 - e X or  0 as  x > 0 or  x <_  0 respectively. 

Then  G_1(y)  = -iln(l-y)  for  0 <_  y < 1 and  F e F^(G)  <=>  F is  continuous 

on  [0,»)  and  - Jin  ( 1-F(  x)  )/x  is  nondecreasing  on  R+(F)  <=>  F is  a 
continuous  IFRA  distribution.  In  particular  F s.s ( G ) = F.,.(Gx)  VA  > 0 where 
Gx(x)  = 1 - e *X,  x > 0. 

Associate  with  F},.(G)  the  families  of  distribution 


(2.4) 


(x| 

5)  = 

fG(xG-1(a)/5), 

x<C 

l1 

xH 

( X I 

|C)  = 

r° 

x<C 

\^G( xG  1(a)/C), 

(2.5) 
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where  0 < £ < °°. 


Remark  2.4.  G 1F...  _(*|£)  and  G „(‘|£)  are  starshaped  on 

“,S  ‘'l*' 

R+(F...  (•£))  and  R+(F5,.  ,.(‘|£))  respectively  and  x ( F.,.  (’|£))  = 

x (F.,.  (*|£))  = £.  However  neither  F.,.  (*|c)  nor  F.,.  (-|£)  is  in  F 

Ot  '*  5 X/  **  9 5 X U 

and  consequently  neither  is  in  F.,.( G). 


Theorem  2.2.  For  all  0 < £ < °°  and  F e F.,.(G,£), 


F*,s(-I£)  r %t  1C). 


Proot.  If  R+(F)  = ( 0 , M ) then  x < 0 =>  F(x)  = G(x)  = F.,.  s(x|£) 
F...  ( x | £ ) = 0 and  V0<x<£<_y<M 


G 1F(x)/x  £ G LF(£)/£  = G 1(a)/£  <_  G 1F(y)/y 
F(x)  < G(xG_1(a)/£)  < G(yG_1(a)/£)  < F(y). 


If  M = +°°  the  proof  is  complete  while  if  M < +°°  then  ¥ y > H 
F(y)  = 1 > G(yG  ^(a)/£)  and  the  result  follows.  See  Figure  2.1. 

Since  both  (F.  ( • | £ ) 1 0 < £ < »}  and  {F.  ( * | £ ) 1 0 < £ < °°}  are 

'‘»S  **  9 *• 

(stochastically  increasing)  scale  parameter  families  the  result  can  be 
slightly  strengthened  as  follows. 

Corollary  2.1.  For  0 < £’  <_(<£" 

(a)  F e F.,(G,£')  ">  F < F ( • | £ ) and 

“ Si  “ 5 K 

(b)  F e F.,(G,£")  ->  F.  (-|£)  < . F 

•••,s  1 st 

Remark  2.5.  It  can  be  shown  that  the  bounds  F.,.  ( • | C ) and  F.,.  „(-|£) 

, s ^ 


3.  Fixed  Size  Subset  Selection  Procedures 


3.1  Preferred  Population  Formuation 


When  faced  with  the  problem  of  selecting  at  least  r of  the  t best 


of  k populations  (l<r<t<k)  an  experimenter  may  well  be  indifferent 


between  the  selection  of  one  of  the  t best  populations  and  one  of  the 


(k-t)  worst  populations  when  they  are  sufficiently  close  together.  This 


is  the  rationale  of  the  indifference  zone  formulation  (IZF)  of  selection 


problems  in  which  the  procedure  is  only  required  to  guarantee  correct 


selection  (CS)  with  prespecified  probability,  P",  when  the  t. 


(t+l)£l  best  populations  are  sufficiently  "far"  apart  (see  Mahamunulu 


(1967) ) 


Alternatively,  if  the 


worst  popula 


tions  are  sufficiently  "close"  so  that  an  experimenter  would  be  satisfied 


with  the  selection  of  either  one  then  one  can  define  a preferred  population 


as  any  population  satisfying  this  criterion  and  a correct  selection  as  the 


selection  of  at  least  r preferred  populations.  The  preferred  population 


formulation  (PPF)  requires  the  procedure  make  a correct  selection  with  at 


least  probability  P“  no  matter  what  the  true  vector  F of  distributions 


where 


is  a known  interval  of  the  real  line 


Suppose 


defined  in  terms  of  a function 


is  continuous 


Definition  3.1.  it.  is  a preferred  population  (relative  to  p(-))  iff 
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Clearly  at  least  "(k-t+l ) ’ ' ' ' ,1T(k ) are  Pre^erre<^  populations  and  the  true 
population  configuration  will  contain  between  t and  k preferred  7r^'s. 

Remark  3.1.  The  PPF  yields  a strengthening  of  the  probability  guarantee  over 

the  IZF.  Consider  the  problem  of  selecting  the  t(r=t)  best  ir.'s  when 
k 

the  true  F e fi  = FT  where  ft'  = {F  e fi  x (Fr,  _)  > p(x  (Fri  ..-,))}  and 
^ I 'v,  a [k-tj  a Lk-t+lJ 

fi'  = fi\fi'  are  the  indifference  zone  and  preference  zone  respectively. 

The  corresponding  problem  based  on  the  PPF  is  to  select  at  least 
t preferred  populations  (relative  to  p(*)).  For  F e fi'  the  t 
best  populations  are  the  only  preferred  tt^'s  and  both  the  IZF  and  PPF 
guarantee  their  selection  with  probability  P*.  For  F e fi'  the  PPF 

still  guarantees  the  selection  of  at  least  t preferred  tt^'s  while  the 

IZF  guarantees  nothing. 

Two  specific  choices  for  p( • ) are  now  derived.  Recall  that  for 
F e Fa(G)  any  scale  change  of  F(x),  say  F(6x),  is  equivalent  to  F(x). 
Hence  we  require  that  be  preferred  relative  to  t+1)  independent 

of  the  selected  measurement  units.  More  precisely  this  means  tk  is  close 

"to  ^(k-t+l)  w^en  e Fft(G, £'/<$)  and  F[k_t+l]  e ^*(^,5/5)  independent 

of  6 > 0,  i.e.  p(*)  must  satisfy  V , £ e (0,°°) 


(3.5) 


£’  > p(£)  <=>  £'/6  > p(£/6)  V <S  > 0. 


I 


It  can  be  shown  that  the  unique  p( • ) satisfying  (3.1)  - (3.5)  is 

p(£)  = c"£  where  0 < c'!  < 1.  In  the  tail  ordered  case  a similar  argument 

based  on  the  location  invariant  classes  Et(G)  leads  to  the  choice 

p(£)  = £ - d”  where  0 < d”  < <*>.  Hence  tt^  will  be  called  preferred  in 

the  star  (tail)  ordered  case  iff  x (F.)  > c*x  (Fr,  , ,)  (x  (F  ) - 

a 1 n [k-t+1]  v a [k-t+1]  ' 


In  the  remainder  of  the  paper  G is  a fixed  distribution  in  Fn(F ) 


with  1(G)  = (0,°°)  (R)  in  discussions  of  the  star  (tail)  ordered  model 


Given 


select  a subset  of  size  s containing  at  least  r preferred  populations 


must  automatically  occur.  Let  N 


Take  independent  random  samples  of 


ize  n from  each  population 


where  T 


i i — [k-s+1] 

the  ordered  sample  a -quantiles  T 


min{ t,s} 

;ment : Given  Pste  ( £ ( 

j=r 

determine  the  smallest  n > N 


where  the  event 


3.2  Selection  from  Star  Ordered  Families 


For  fixed  a e{0,l)  it  follows  without  loss  of  generality  from  Remark 


2.3  that  G can  be  choosen  so  that  G (a  ) = 1.  Define  the  following 
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r^l  e n*lx0(F[k-l])  - c*xa(F[k-t+l])  < Xa(F[k-i+l])}  ’ t-i<k 


(3.9)  fl*(i 


*(i)  = { 


{F  e fi*lc“xa(F[k-t+l])  < Xa(F[l])} 


, i=k. 


(3.10)  ^ - (F  e ^|xa(F[k_t+l])  = U. 


(3.ii)  n.(i,o  = n.(i)  n n.,'  F. 

k 

It  follows  that  fi,..  = U r = U U n.,.(i,C)  and,  for  example,  n},{(i,C) 

C>0  £>0  i=t 

is  the  set  of  configurations  for  which  there  are  exactly  i preferred 

populations  and  the  t^h  smallest  a-quantile  is  £. 

It  is  easy  to  see  that  for  F e fijl{(i)  where  r + k-  s<_i<_k 

that  P [CS|r  (s,n)]  = 1.  Hence  inf  P [CS|R  (s,n)]  = min  inf  P [CS|R  (s,n)] 
I 1 % i 1 iel  G*(i)  l 1 

where  I = { i 1 1 <_  i < r + k - s}.  For  F e fi.,.  and  iel  define 


= sample  a-quantile  from 
W.  = rJi.  largest  of  {T(k_i+1)> • • • »T(k)*  and 

A.  = {W.  > at  least  (r+k-s-i)  of  T, ,1,. 
l i — ( 1)  (k-i) 


Remark  3.2.  It  can  be  shown  that  the  A^'s  form  a nondecreasing  sequence 

of  events  and  that  X.  (T),  the  characteristic  function  of  A.,  is  non- 

A^  'v  i* 

increasing  in  any  of  q ) » • • * »T(k_i ) and  ds  nondecreasing  in  any  of 

T(k-i+l ) ’ ’ ' ' *T(k)  ’ T*lFs  lat‘ter  property  and  an  application  of  Theorem  2.1 

implies  that  P„[A.]  = E [x.  (T)]  is  nonincreasing  in  any  of  Fr  ,,..., Fr  ., 
A,  1 Ai  i1-1  Lk-iJ 

and  is  nondecreasing  in  any  of  F^  » • • • »F[k]  under  stochastic  ordering 


t ft*** 
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< (see  Mahumunulu  (1967)  or  Alam  and  Rizi  (1966)). 
st 


Lemma  3.1.  inf  P [CS|R  (s,n)]  = inf  P [A.]  for  i e I. 
Mi)  ^ 1 f2...(i,l)  ^ 1 


Proof.  Pick 


F e Mi, O then  P _[CS  |r  (s ,n) ] = P_[A.]  = P_[W./5  > 
'u  « F I rlFi  — 

^ 


at  least  (r+k-s-i)  of  T,  ,T,,  .,/£].  Now  T'  = T/£  has  the 

(1)  l k-i ) a, 


vector  of  cdfs  F'(x)  = ( F ( £x) , . . . ,F.  ( £x) ) which  is  in  ft.,(i,l)  by  the 
'v  1 k ** 


equivalence  of  F^  under  scale  changes  and  the  definition  of  preferred 


Tt..  So  Pp[A.]  = Pp,[A.]  and  the  proof  is  complete. 
o>  1 


Remark  3.3.  An  equivalent  way  of  viewing  the  proof  of  Lemma  3.1  is  as  an 
application  of  invariance.  The  problem  is  invariant  under  the  group  of  all 
possible  common  scale  changes  of  the  raw  data.  Hence  the  risk  under  a 
0-1  loss  structure  becomes  the  probability  of  incorrect  selection  and  is 
constant  over  orbits  - the  orbit  of  F consisting  of  all  common  scale 
changes  of  the  components  of  F. 

Next  a lower  bound  for  P_[CS| Rn ( s ,n) ] over  F e , is  obtained. 

I 1 % -,1 


Given  i e I and  F e ft5,.(i,l)  it  follows  from  Corollary  2.1  that 

'Xj  * 


(3.12) 


F[j]  V fm<  ■'<:"> 


{ F*,s(,|l)  "st  F[j] 

F*.s(‘lc*>  "st  F[i]> 


, 1 I j I k - i 

,k-t+l<_j<_k 
, for  any  other  j 


since 

x (Fr  .-.)  < c 
a []J  - 

and  c‘ 

< X (Fr-n) 

a L:  J 

having 

k - i,  t , 

F...  .(  • | 

lc*),  F...  (• 

" 9 ^ 

*,s 

[j]' 
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PFLCS| R1(s,n)3  = PpCAi3 


>_P^,(i)[A^]  from  (3.12)  and  Remark  3.2 


— PF(i)*-At^  since  At  c A£ 


- PF(t)l-At] 

*\j 


where  the  final  inequality  holds  since  when  i > t the  it^(k-i+l^~i<k-t ) 

ordered  component  of  F(i)  is  F.  ( • |c*)  ■<  F.  .(•|c,‘),  the  jlil 

t\j  “,S  si  « , JC 

ordered  component  of  £(t). 


To  calculate  the  lower  bound  PF/.  nLA  ] let 

A/  *■ ' ^ 


(3.13) 


H(y ) = B(B(y);  k-t-s+r,  k-t) 
J(y)  = B(B(y) ; t-r+1,  t) 


where  B(y)  = B(y;q,n),  q = [(n+l)a]  and  [•]  denotes  the  greatest 
integer  function.  Hence  H(F.,.  „(y|c“))  is  the  cdf  of  the 
(k-t-s+r)l*l  smallest  of  ^T(  q ) ’ • • * ^ and  cdf 

of  the  (t-r+l)lil  smallest  of  (T^  } all  under  F(t). 


Therefore 


pnticV  1 j H(r*>,<ylc*»"(r*fB<yl1» 

r\.  U 


H(G(y/c'{)  )dJ(G(y) ) 


+ H(G(  1/ c" ) ) ( 1 - J(ct)) 

f“  -1 

= H(G(G  (y)/c*))dJ(y) 

G(c*) 

+ H(G(l/c*))(l-J(a)) 

= F*(r,t,s,k,a,c*)  or  simply  F*. 


19 


Since  F(t)  ^ U..j  (the  component  distributions  are  not  in  F;..(G))  it  is  not 
% 

immediate  that  F"  is  the  infimum  of  the  PCS  over  . The  next  result 
n 

establishes  that  this  is  the  case. 


Theorem  3.1.  inf  P [CS|R  (s,n)]  = F" (r ,t , s ,k, a,c“ ) 

n*  i 

Proof.  From  Lemma  3.1  and  the  discussion  above  it  suffices  to  exhibit  a 

sequence  of  configurations  {,£ . } in  fy.  so  that  lim  P^  [CSjR^Sjn)]  = F“. 

j-HJO  -vj 

A sequence  in  Qj.  which  approximates  F(t)  is  the  obvious  candidate.  Let 
L = [1/c"]  + 1 and  for  j > L define 


0 ,y  < c - 3 

F^  j (y)  ^ G[j(y  - (c*-j  *))]  ,c  - j 1 _<  y < c!S 


G(y/c:‘ ) 


,c  < y 


F .(y) 
s, } 


G(l+j (y-1 ) ) 


,y  < i 


,1  < y. 


Then  F.  . ( • ) -*  F...  ( • |cM)  =>  H(F0  .(•))-►  H(F  ( • |c  ))  and 

F . ( • ) ^ F.  ( • |l ) =>  J(F  . ( • ) ) 5 J(FA  ('ll)  where  ” denotes  weak 

s,  j -,S  ' s,]  *,s  1 

convergence  of  distributions.  Furthermore  F . e F (G,c")  and 

X.  , j L 

F . e F.,.(G,1)  and  so  F.  with  t components  F.  . and  (k-t) 

s,j  " <\,D  F 

components  F . is  in  ftft(t).  Therefore 
s » J 
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PF  [A*] 

'Vj 


H(F.  . (y))dJ(F  .(y)) 

0 *»J  S»J 

c’v  r 

0 H(Ft,j(y))dJ(F*,8(yl1))  + jc 


since  F^(y)  = F*5  ^ |c" ) 


H(F*  (y  |c*))dJ(F  (y)) 

9 X/  ^ 9 J 

on  [c5S,°°)  and 


F <(y)  = F*  s(y|D  co.i)  . 


H(F  . (y))dJ(F#.  (y|l))  + 

J J 9 ^ 


H(F*  „(y|c“'))dJ(Fo  ,(y)) 


s,l 


since  F...  (y|c")  = 0 on  [0,c‘‘) 


- 0 + H(Fi;  <y|c*))dJ(F.  (yll))  = F* 
Jo  'li  " > s n 


where  the  first  convergence  follows  from  dominated  convergence  since 

H(F  .(y))  -*•  0 a.e.  [J(F...  (*|l))]  on  [0,c'*:]  and  the  second  follows 

">s 

from  the  weak  convergence  of  J(F  .(•))  to  J(F.,.  ('ll))  and  the  fact 

s,:  > s 

that  H(F.,.  ( * I c ”* ) ) is  bounded  and  continuous  a.e.  [J(F.,.  (*|1)]  on 

” » ° “ » s 

The  proof  is  completed. 

It  can  be  shown  that  F“  -►  I as  n -*■  °°  and  hence  (3.8)  can  be 

n 

guaranteed  by  a finite  n for  any  p”  < 1. 

Example  3,1.  Given  a e (0,1)  choose  G(x)  = 

SI..{  is  then  all  configurations  of  continuous  IFRA  distributions.  Note 
that  G(’)  is  choosen  so  that  G ^(a)  = 1.  Suppose  it  is  desired  to 
select  at  least  r = t preferred  populations.  In  this  case 


{• 


1 - 
0 


xiln(l-a) 


x ^ 0; 
x < 0 
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F = t 
n 


l-(l-o) 


1/c" 

B(B( 1 - [1-xJ  ' ■ );  k-s,  k-t)dB(x) 


+ B(B(l-[l-a]1/c  );  k-s,  k-t)(l  - B(a))t 


For  computational  purposes  it  is  desirable  to  remove  the  composition  of 
incomplete  beta  functions  from  inside  the  integral.  After  an  integration 
by  parts  and  a change  of  variables  F^  becomes 


(3.14)  (k_t) 

k-s 


l-(l-a) 


1/c” 


(1  - B( 1 - [l-y]C  ))t(l  - B(y))S"tdBk_S(y) 


+ (1  - B( 1 - ( l-a)C  ))tB(B(a);  k-s,  k-t) 


Expression  (3.14)  was  evaluated  on  Cornell  University's  IBM  370/169 
computer  using  a procedure  based  on  Simpson's  rule  (see  Shampine  and 
Allen  (1973)).  Tables  1,2  and  3 give  the  smallest  odd  sample  sizes 
required  to  meet  (3.8)  for  a = ,25(.25).75,  c*  = .65,  .70,  P"  = 

.75,  .90,  .95,  k = 2(1)9,  t = l(l)[k/2j,  r = t,  s = t(l)[k/2]. 

One  measure  of  the  cost  of  using  the  nonparametric  procedure 

R^(s,n)  is  obtained  by  comparing  the  sample  size  required  by  R^(s,n) 

to  achieve  a specified  P"  to  that  required  by  the  best  procedure  for 

the  parametric  problem  with  exponential  populations.  Suppose  it.  bar, 

-X/  9^ 

cdf  F(x|6i)  = 1 - e , x > 0 where  0^  > 0 but  unknown  then 

Xa(Fi)  = -6i  *n(1-a)  ^ xa(F(  ‘ l0i>)  > c\(F[k-t+])  <=>  9i  > C"9[k-t  + l] 
where  0r.n  is  the  i ^ ordered  9..  In  this  case  the  optimal  selection 

procedure  is  the  natural  procedure  based  on  sample  means  rather  sample 


c 

* 

= .65 

P* 

.75 

.90 

.95 

15 

33 

51 

21 

45 

63 

29 

53 

73 

11 

27 

39 

33 

59 

81 

33 

57 

79 

13 

31 

43 

41 

69 

91 

37 

63 

85 

17 

35 

51 

11 

23 

35 

45 

75 

99 

23 

43 

59 

49 

79 

103 

37 

67 

89 

19 

39 

55 

11 

27 

39 

49 

81 

105 

27 

49 

65 

55 

87 

111 

41 

69 

93 

21 

41 

59 

13 

29 

43 

7 

23 

31 

53 

85 

109 

31 

53 

71  • 

19 

39 

51 

61 

93 

117 

35 

57 

75 

63 

95 

119 

4 5 

73 

95 

23 

45 

63 

15 

31 

47 

11 

23 

35 

57 

89 

113 

33 

57 

7 5 

23 

43 

57 

65 

97 

121 

39 

63 

79 

67 

10] 

125 

zes  required  by 

R1(s,n) 

-X 

2 

1 

1 

3 

1 

1 

4 

1 

1 

4 

1 

2 

4 

2 

2 

5 

1 

1 

5 

1 

2 

5 

2 

2 

6 

1 

1 

' 6 

1 

2 

6 

1 

3 

6 

2 

2 

6 

2 

3 

6 

3 

3 

7 

1 

1 

7 

1 

2 

7 

1 

3 

7 

2 

2 

7 

2 

3 

7 

3 

3 

8 

1 

1 

8 

1 

2 

8 

1 

3 

8 

1 

4 

8 

2 

2 

8 

2 

3 

8 

2 

4 

8 

3 

3 

8 

3 

4 

8 

4 

4 

9 

J 

1 

9 

1 

2 

9 

1 

3 

9 

1 

4 

9 

2 

2 

9 

2 

3 

9 

2 

4 

9 

3 

3 

9 

3 

4 

9 

4 

4 

•_3. 

Sample  si 

when 

G(x 

oi-quantiles  (Eaton  (1907)).  Denote  this  procedure  as  R (s,n).  It  is 


when  r = t this  integral  reduces,  after  a change  of  variables,  to 


Expression  (3.15)  was  evaluated  using  an  algorithm  based  on  Simpson's 


rule.  Table  4 contains  the  smallest  odd  sample  signs  required  to  satisfy 


t = l(l)[k/2]  and 


t(l)[k/2] 


These  calculations  how  that  the  exponential  procedure  requires  roughly  half 


as  many  observations  as  the  non  parametric  IFRA  procedure 


3.3  Selection  from  Tail  Ordered  Families 


Choose  0 r 


It  follows  that  ft 


invariance  arguments  similar  to  the  scale  invariance  arguments  of  Lemma  3.1 


it  can  be  shown  that 


Lemma  3.2 


min  1 


where  A.  and  1 are  defined  as  in  subsection  3.2 


From  the  stochastic  bounds  developed  in  Corollary  2.2  for  F e ^(G) 
and  arguments  similar  to  those  preceding  Theorem  3.1  a tight  lower  bound, 
F^,  can  be  constructed  for  inf  P[CS I R.  ( s.n)  ] . 


Theorem  3.2.  inf 


where  F(t)  is  any  configuration  with  t components  F Ms)  and 
^ t , s 1 

(k-t)  components  F ( - 1 - d sV ) and 

t ,1  1 
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4 . Random  Size  Subset  Selection  Procedures 

4. 1 Preferred  Population  Formulation 

When  the  goal  of  an  experiment  is  to  screen  a large  number  of  populations 
to  obtain  a more  manageable  subset  containing  at  least  r preferred  popula- 
tions then  the  procedure  R^(s,n)  has  the  undesirable  characteristic  of 
always  selecting  a subset  of  size  s.  This  section  studies  the  random  size 
restricted  subset  selection  procedures  introduced  by  Gupta  and  Santner 
(1973)  and  Santner  (1975).  These  rules  are  characterized  by  the  properties: 

(1)  the  experimenter  specifies  an  upper  bound,  say  s,  on  the  size  of  the 
selected  subset  and  (2)  the  procedure  is  able  to  capitalize  on  configura- 
tions favorable  to  the  experimenter  by  selecting  fewer  than  the  maximum 
number  of  s it  . ' s. 

l 

Following  Santner  (1975),  restricted  subset  selection  procedures  will 

be  defined  m terms  of  a sequence  of  functions  T = {iJj  } , , V : E E 

n n=l  n 

satisfying: 

(4.1)  V £ e E and  Vn,  ^(C)  > S, 

1,4.2)  Vn,  is  continuous  and  strictly  increasing  and 
(4.3)  V 5 e 5,  i|i  (£)  ->-5  as  n -*  ». 

Example  4.1.  For  E = (0,°°)  take  ^(C)  = £pe^n^  where  p e(l,°°)  and 
(e(n)}  is  any  sequence  of  positive  numbers  converging  to  zero  as  n -*■  <*>. 
Example  4.2.  For  5 = R take  !|/n(£)  = £ + e(n)  where  e(n)  is  as  in 
the  previous  example. 

For  specified  a e(0,l),  and  each  s in  {l,...,k},  T satisfying 
(4.1) — (4.3)  and  n >_  N define  the  procedure 


r ma m - m mm --,- 

' ■'■  ■'  **  --•  - n—  - ■ '._ 


r 


hd 
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R (s^jn):  Take  independent  random  samples  of  size  n from  each  popula- 
tion and  select  tt  . <-^>  T.  _>  max {T^- k_ S+1  -] » T[ ^ ^ ) } where 

Tr,  , < ...  < Tr,  -i  are  the  ordered  values  of  the  sample 
[1]  — — LkJ 

a- quantiles  T^,...,T^. 

Design  Requirement:  Given  P*  e(t/k,l),  1 _<  t < k,  1 < s < k-t+1,  T 

and  0 < c!:  < 1 (0  < d"  < °°)  determine  the  smallest  n > N so  that 

— a 


(4.4) 


Pr[CS  |R0(s,y,n)]  > P"  VF  e £2*  (£2  ) 
1 z — % t 

% 


where  the  event  [CS  |R^(s , T ,n ) ] occurs  iff  at  least  one  preferred  tt^ 
is  selected. 

Remark  4.1.  In  this  formulation  it.  is  preferred  <=>  x (F.)  > 
i a i 

c"x  (Fr,  , ,)(x  (Fr,  ^ - d")  in  the  star  (tail)  ordered  model, 

a Lk-t+lj  a Lk-t+lJ 

However  only  the  goal  corresponding  to  r = 1 of  Section  3 is  discussed. 
The  monotonicity  arguments  jsed  below  in  establishing  the  infimum  of  the 
PCS  are  invalid  when  r > 1.  When  R2(k-t+l,4',n)  selects  k-t+1 
populations  a correct  selection  automatically  occurs  and  when  s = 1 the 
rule  reduces  to  R (l,n).  The  results  in  Sections  4.2  and  4.3  are  valid 
for  1 _<  t < k and  1 < s < k-t+1.  The  case  s = k-t+1  can  be 
easily  analysed  by  straightforward  arguments  similar  to  those  developed 
below. 

4 . 2 Selection  from  Star  Ordered  Families 

Given  a e(0,l)  choose  G as  in  Section  3.2  and  for  1 <_  t < k 
define  £2 *,  £2.«{(i),  £2..{  and  £2,..(i,£)  by  (3.6),  (3.9),  (3.10)  and  (3.11) 

* S 

respectively.  Clearly  Pp[CS  |R2(s,Y,n)]  = 1 V F e £2^,  Let 
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I £ {t,...,k-l}  and  for  i el  and  F p ft.,.  define  = max{T^k  l+i ) ’ ‘ - ‘ ,T(k ) ^ 


(4.5)  B.  = 

1 


{W.  _>  at  least  (k-s+l-i)  of  Tp^,...,T^ 

W.pe(n)  _>  max{T^j|l  j <_  k-i}}  , t _<  i _<  k-s 


{W.pe(n)  _>  max{T^_.  ^ jl  <_  j <_  k-i} 


, k-s  < i < k-1 


where  as  before  T^p  is  the  sample  a-quantile  with  cdf  Fp.j.  Then 

Vi  £ I and  F e ft. , Pr[CS  |R0(s,  >)',n)]  = Pr[B.].  Furthermore  the  events 
'X*  1 i 1 Z r l 

(b.)  form  a nondecreasing  sequence  and  Pp[B^]  = E^Cx..  ( ] is  nonincreasing 

in  any  of  F^-j , . . . ,F^_  . -j  and  nondecreasing  in  any  of  F^  i+i  ] ’ * ‘ ’ F[k] 

under  -<  . 

st 


Lemma  4.1.  inf  P^tCS |R0(s,V,n)]  = min  inf  P[B.]. 

0*  iel  ft.(i,l)  1 


Proof.  It  suffices  to  show  inf  P^[B.]  = inf  P,,[B.].  When  k-s+1  < i < k-1 

«v(i)  ^ 1 cyi’D  ^ 1 " “ 

and  F e then 

% 

Pj^LB . ] = PF[max{T(  . )Pe(n)/C|k-i+l  < j < k}  > max{T(j)/£|l  < j 1 k-i}] 

= Pp  [B^  ] where  F'(x)  = (F(xC) ,. . . ,F^(xC) ) is  in  ft5,.(i,l) 

as  noted  in  the  proof  of  Lemma  3.1.  The  proof  for  the  case  t < i < k-s 
is  similar. 

Now  the  events  { } and  {B^}  possess  similar  monotonicity  properties 

and  so  the  argument  previously  used  in  the  construction  of  the  lower  bound 

F*  remains  valid  and  yields 
n 
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(4.6)  inf  Pp[CS | R^(  s ,T  ,n ) ] _>  P F(t)[Bt''  E R^(t ,s,k,a,c‘!)  = R^" , say, 
ft.-.  K' 


where  F(t)  is  defined  in  Section  3.2. 

'X, 


The  lower  bound  will  now  be  evaluated.  Let  T[q]  136  the  qth-  ordered 

sample  a-quantile,  V = {k-t+1, . . . ,k} , V = {l,...,k-t},  for  q > t let 

{SH|l<v<(  )}  be  the  collection  of  all  subsets  of  size  (q-t)  from 

v 1 — — q-t 

V and  let  = V - Sq.  Then 
v v 


= TU1‘  V 


q Jstl  , Jttl  W*  - To> = Tt<U‘  T(i/  " i “‘W1  i ■ i k-t,] 

„L  ■ L Tpp>»rT'>'<T'i)n!s’uriii  i 

q=k-s+l  j=k-t+l  v=l  ^ j 


T.  ..  c T,  < T.  ..Pe(n)  Vie? 
(j)  (Z)  (:)  v 


= l (£"!)[  {B(F*  t(y|c*))}q_t{D  (F*  p ( y | c * ) ) }k-qd{B( F,  (y ll))}1 
q=k-s+l  q-t  Jo  n ,1  ,s 


.-1/  .x  e(n) 


k « ret  , . ^“1/  \ e(n) 

l (*"*)  , {B(G(G"1(y)/cJ;))}q~t{B(d(G--(^ ))  - 

q=k-s+l  q JG(c”)  c 

B(G(-~1-4y)))}  k"q  dWy)}1 


+ (B(G(-i-r) ) }q-t{D  (G(~4t)  ) )k_q{l  - Bt(a)  } 

r A nr” 


where  D (F(y))  = B(F(ype  n ' ) ) - B( F(y ) ) . As  in  the  case  of  R^Sjn) 
this  lower  bound  is  tight. 
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Theorem  4.1. 


inf  PrCCS|R  (s,¥,n)]  = 
ft*  0/ 


The  computation  of  lim  Pp  CCS J R2(s,V,n)]  for  the  sequence  {F.  } 

j-x»  o»j 

displayed  in  the  proof  of  Theorem  3.1  yields  the  result.  Furthermore 
R“  -*•  1 as  n -+  00  and  (4.4)  can  be  guaranteed  by  finite  n for  any 

A 

P*‘  < 1. 


Example  4.3.  When  ft.,,  is  the  class  of  all  continuous  IFRA  distributions 


(G(x)  = 1 - ex^n^q  a^)  then  R*(l,s,k,a,c*)  becomes 


i <::}> 


q=k-s+l 


ft{B(l-(l-x)1/c*)}q"1{B(l-(l-x)pe(n)//c*)  - B( l-(l-x )1/c" ) }k_q 


1-U-olV 


(4.7) 


dB(x)  t {B(l~(l-a)1/c  )}q_1{l  - B(a) } x 


•(l-a)pe(  ^C")  - B(l-(l-a)1/C*)}k_q 


Tables  of  the  smallest  odd  sample  sizes  required  to  satisfy  (4.4)  were 
computed  from  (4.7)  based  on  p = 4 and  e(n)  = l//n  for  a = .25,  .50,  .75, 
c*  = .65,  .70,  P*  = .75,  .90,  .95,  k = 3(1)9  and  s = 2(l)min{4 ,k-l} . 

4.3  Selection  from  Tail  Ordered  Families 

This  section  studies  the  design  problem  for  Rj(s,T,n)  when 

F e ft  and  H 1 (£)  = £ - e(n)  where  (e(n)}  is  a sequence  of  positive 
t n 

numbers  decreasing  to  zero.  The  developments  follow  from  modifications 
of  the  arguments  of  Section  4.2  along  the  lines  of  Section  3.3  and  hence 
only  the  final  results  will  be  stated. 


Sample  sizes  required  by  R2(s,4',n)  when  selecting  from 
fi.  when  G(x)  = 1 - e X with  p = 4,  e(n)  = l/Jr\  and 


43 

103  1 

55 

47 

111  li 

67 

55 

123  l: 

B3 

51 

123  li 

83 

55 

123  li 

87 

65 

139  2' 

59 

131  r 

95 

63 

135  ■ 1’ 

99 

75 

155  2: 

23 

67 

143  2i 

07 

69 

147  2: 

11 

85 

167  2: 

35 

71 

151  2: 

19 

75 

159  2: 

23 

93 

179  2 

51 

77 

159  . 2: 

27 

83 

167  2i 

35 

99 

191  263 

3 

2 

4 

3 

4 

2 

5 

4 

5 

3 

5 

2 

6 

4 

6 

3 

6 

2 

7 

4 

7 

3 

7 

2 

8 

4 

8 

3 

8 

2 

9 

4 

9 

3 

9 

2 

R2(s,Y,n)  when  selecting  from 
with  P = 4,  e(n)  = 1/Vn  and 


Table  4.2.  Sample  sizes  required  by 

_ 

fl,.  when  G(x)  = 1 - e 


23 

55 

83 

19 

49 

77 

21 

53 

79 

29 

65 

95 

21 

55 

83 

25 

59 

87 

35 

73 

105 

25 

59 

89 

29 

63 

95 

39 

81 

113 

27 

63 

93 

31 

69 

101 

4 5 

87 

121 

29 

67 

97 

35 

75 

.107 

49 

91 

127 

= .65 

n 

ii 

o 

P* 

P* 

.90 

.95 

.75 

.90 

.95 

?La 


Theorem  4.2.  inf  P[CS  I R_(s , T ,n)  ] = Pi,/j.n[B4.]  = R1  where  F(t) 
1 2 F(t)  t n ^ 

at  t t ^ 

is  as  in  Theorem  3.2  and  R = R (t ,s,k,a,d“ ) is  given  by 


I O 

q=k-s+l  q 


JG(-d*) 


{B(G[G  L[x]+d*])}q  t{B(G[G  1[x]+e(n)+d::] ) 
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B(G[G  1[x]  + d*])}k  q d{B(x) }t 


+ (B(G[d*])}q  t{B(G[e(n)+d*] ) - B(G[d*])}k  q{l-Bt(a)} 


Furthermore  R ->  1 as  n -*•  00  and  so  (4.4)  can  be  guaranteed  for  any  P*  < 1. 

Example  4.4.  If  £2  consists  of  all  configurations  with  continuous 
components  having  "lighter"  tails  than  the  logistic  distribution  then 


G(x ) = [1  + exp(-  (x  - £n[(l-a)/a]))3 


-1 


The  lower  bound  R on  the 
n 


probability  of  correctly  selecting  at  least  one  Tin  having  a-quantile 
within  d“  of  the  largest  a-quantile  (t=l)  is 


WWlmn  hi  in 


5.  Properties  of  R^(s,n)  and  R2(s,V,n) 


The  purpose  of  this  section  is  to  describe  some  small  sample  performance 


characteristics  of  R^s^)  and  R (s.'Fjn)  including  (1)  the  effect 


of  skewness  and  kurtosis  of  R (s,n),  (2)  their  monotonicity  properties, 


(3)  their  performance  as  c"  ->  1 and  some  large  cample  properties  including 


(1)  a study  of  the  number  of  populations  selected  by  R (s,¥,n)  and 


(2)  a proposal  for  choosing  the  sequence  f. 


The  first  set  of  results  show  that  the  more  skewed  or  the  heavier  the 


tails  of  cne  component  F^'s  'the  smaller  the  probability  of  correct 


selection  using  R^(s,n).  The  analysis  follows  Doksum  (196y)  who  used 


weak  orderings  to  study  the  effects  of  skewness  and  kurtosis  on  the  power 


of  monotone  rank  tests.  It  has  previously  been  established  in  Remark  3.2 


that  the  "closer  together"  the  preferred  and  nonpreferred  populations  are 


stochastically,  the  smaller  the  PCS.  In  studying  the  effects  of  skewness 


and  kurtosis  it  is  necessary  to  eliminate  this  stochastic  effect  by  restrict- 


ing attention  to  configurations  of  distributions  differing  only  in  scale 


or  location.  In  this  set  up  the  resulting  scale  or  location  parameters 


are  measures  of  "stochastic  distance."  For  1 < t < k and 


ie  I = { i[  t < i<  r+k-s}  let  V ■ = (A.  = (6  , . . . ,6 , )|°°  > 6,  > > 

1 — l'ui  lk1  1—2  — 


>6,  . = 1 > 6.  .,>...>  6.  > 0 and  c"  > 6,  . > c"6,  . ,}  and  for 

— k-i  k-i+1  — — k — k-t+1  k-i+lJ 


F c F and  A.  tV.  let  F(A  . ) = (F.  , F.  , ...,F_  ) where  F.  (x)  = F(<5.x). 
0 'ui  l 'x.'vi  °1°2  °k  3 


Remark  5.1.  If  F e F n has  F(£ ) = a then  x (F.  ) = £ /5  . . Hence 


a 6 . 
I 


,£(A^)  contains  exactly  i preferred  populations  and  F^-j  = F^ 


Theorem  5 , 1 If  F,  H t Fq  satisfy  F H and  either  H(x)  < 1 V ' x e R 


or  F(x)  < 1 V x e R then  V i e I and  A . e U 

1 1 


i 


PF(A.)CCSlRl(s’n)]  ± PH(A.)CCSlRl(S’n)] 


Proof.  Suppose  T,..^  has  cdf  B(F„  ),  1 < i < k,  and  define 
'.33  0^  — — 

(5.1)  = [B(H)]_1  B(F(T(j)))  = H_1F(T(j))  and 


(5.2)  T''  = [B(H  )]  1 B(F  (T, . , ) ) = H XF( 6 .T, . . )/6 . . 

<■33  Oj  Oj  v 3 3 1 <.33  1 


It  follows  that  Til.  has  cdf  B(H.  ) for  1 < j < k and  Tl,  ..  = 

(] ) 6^  — J — (k-i) 

T 1 ’ . . has  cdf  B(H)  since  6,  . = 1. 

(k-i)  k-i 

Since  6j  z (0,1)  for  k-i+1  j <_  k it  follows  from  Lemma  2.2  that 


(5.3) 


T’  . = H 1F(T,..)  > H_1F(6.T, ..)/6.  = TJ  • 
(l ) (l  ) - 1 (l ) 1 (l ) 


provided  T, . and  6.T,..  e R (F).  Now  has  distn  B(F)  hence 

(])  3 (3)  3 (3)  - 

6.T...  c R+(F)  a.s.  (F  ).  If  F(x)  < 1 V x < 00  then  T,  . , z R+(F)  = 

3 <33  Oj  (.1) 

(0,«>)  while  if  H(x)  < 1 V x < «>  then  ^ { R+(F)  =>  F(T^_.^)  = 

1 =>  Tl..  = H X(l)  = ».  In  either  case  (5.3)  holds  a.s.  (F.  ).  Since 

ll ; 0 j 

1 < 6.  for  1 < i < k-i  it  also  follows  from  0 < T,  . . < <5.T. ..  that 

- 3 ~ - - (3)  - 3 (3) 


(s.m  t;.,  . ■■‘m()))1r1p(u(j))/.j  = tjj, 


holds  a.s.  (Fg  ). 


Because  x4  (•)  depends  only  on  the  ranks  of  the  T,  . , 1 s and 
A . VI/ 

-1  # ' 

H F( * ) is  strictly  increasing  on  1(F)  and  nondecreasing  on  (0,“)  it 


can  be  seen  that 


RMHP I 


Z 
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(5.5) 


^.(T(l)’"',T(k))  = ^.(T(l)’‘,,,T(k))  hence 


(5.6) 


XA_(T(l),...,T(k))  2 XA.(T(i)>*-->T(k)) 


holds  a.s.  (F(  A. ) ) from  (5.3),  (5.4)  and  (5.5).  Taking  expectations 

^ 'U. 

of  both  sides  of  (5.6)  wrt  F(  A. ) and  recalling  that  TJ! v has  cdf 

% ’U  t ] ) 

B(H  ) completes  the  proof. 

6j 

Remark  5.2.  If  R+(F)  = (0,  bp)  and  R+(H)  = (0,bH)  where  max(bF,bH)  < » 
then  (5.3)  need  not  hold  for  6^  e (0,1)  and  hence  the  proof  is  not 
valid  in  this  case. 

The  results  describing  the  effects  of  kurtosis  on  FpCCG  jRp(s,n)3 

% 

will  now  be  stated.  For  F,  H e F recall  that  F H means  that  H 

has  heavier  tails  than  F.  For  0 e R and  F e F let  F^(x)  = F(x+0) 

and  for  i e I define  F.  = {0.  = ( 0^  . . . , @k)  |°°  > 0X  ^ - - ek_i  = 0 > 

0 > . . . > 0,  > -d*  >0,  , > 0,  . , - d5'}  and  for  F e F 

k-i+1  — — k 5 — k-t+1  k-i+1 

and  0.  e Fp  let  £(0.)  = (F0  ,...,F0  ).  The  configuration  F(0J  contains 
• i k 

exactly  i preferred  populations.  The  following  theorem  shows  the 
heavier  the  tails  of  the  population  distributions  the  smaller  the  probability 
of  a correct  selection. 

Theorem  5.2.  If  F,  H e F satisfy  F ■<  H and  either  F(x)  < IV  xe  R 

or  H(x)  < 1 V x e R then  V i e I and  0.  e F. 


P/[(0.  )[CS|R1(s,n)]  I PH(0.  )[CSlRl(s’n)]- 


Remark  5.3.  The  proofs  of  Theorems  5.1  and  5.2  are  not  valid  for  the 


procedure  R (s,V,n)  because  the  characteristic  functions  of  the  events 
do  not  depend  only  on  the  ranks  of  the  T^'s. 


The  monotonicity  properties  of  R (s,n)  and  R (s,t,n)  will  next 


when  referring  to  R (s,n) 


and  r=l  when  referring  to  R (s,?,!}).  Given  an  arbitrary  selection 


procedure 


denote  the  ordered  components  of  S 


Definition  5.1.  P is  monotone  wrt  ft  iff  H,  11  < i < k and  F e ft 


Recall  that  for  a given  set  of  configurations  ft,  p(*)  satisfying 


exactly  i preferred  populations 


Definition  5.2.  P is  weakly  monotone  wrt  ft  iff  V i,  t <_  i <_  k and 


whenever  S 


where 


Remark  5.4.  The  standard  definitions  of  monotonicity  used  in  parametric 


problems  differ  from  those  given  here.  In  particular  Definition  4.1  of 


monotonicity  in  Santner  (1975)  is  a special  case  (i  = r = 1)  of  the 


present  definition.  Also  note  that  monotonicity  implies  weak  monotonicity 


Lemma  5 . 1 Any  procedure  P in  {R  (s,n 





The  result  is  immediate  since  [S^|R^(s,n)]  = [R^(s,n)  selects  at  least 
r of  "(j)*  3 e and  CSi|R2(s,'i',n)]  = [R2(s,T,n)  selects  at 

least  one  of  ij.j,  j e S^]  are  nondecreasing  in  any  of  T^.j,  j e S, 
and  nonincreasing  in  any  of  Tj.j,  j £ S,, 

Some  additional  notation  will  now  be  introduced.  Let 


Qst  = {£  e n(F)lF[i]  V F[2]  %t  "st  F[kJ}’ 


V1}  = 


{F  e n(F)|xa<F[k_i])  1 P(xa(F[k-t+l]))  < Xa(F[k-i+l]); 


min{F  - fv  ) j l<?,<k-i  } > max{F_  ,(y)  j k—  i< Jt<k } V v £ 
lX'J  L ^ J 


for  t < i < k-1 


{F  t n(F)!n(xa(F[k_t+1]))  < Xa(F[i;])}  for  i=k. 


and  = U Note  that  ft  . C ft,,. 

K . . K st  R 

i=t 

Theorem  5.3.  The  classes  of  procedures  {R^CsjnJIn  _>  N^}  and 

(RjCsjTjn)  |n  >_  N^}  are  all  monotone  wrt  any  ft^  C ft  and  weakly 

monotone  wrt  any  ft  cr  S7  . 

Z K 


Proof.  To  show  monotonicity  it  suffices  to  consider  the  following  case. 


Fix  F e ft,  , r<i<k,  S.  c{l,...,k},  j e S.  and  m > i 
<v  1 — — 1 J x J 

let  SI  = U {m >— { j } . Lemma  5.1  implies 


with  m e S. 


(5.8) 


PF[S.|R1(s,n)J  = P[Si|R  (s,n)] 

(FCi] * * ’ ,F[ j ] ’ ‘ ’ ,F[m]’ ' * ,F[k] ) 

(F[l]»-*»F[m]»'*’FCm]**,*F[k]) 

= PCS^I^Cs.n)] 

( F[l] ’ ’ ’ ,F[m] ’ ' ’ ,F[m] ’ ’ ’ ,F[k] } 


< PF[S^|RL(s,n)] 


Since  R^Csjf^)  satisfies  Lemma  5.1  the  same  proof  also  shows  that  it 
is  monotone  wrt  8^.  The  proofs  that  R^(s,n)  and  R^Cs^n)  are 
weakly  monotone  wrt  are  similar. 

Remark  5.5.  In  most  parametric  selection  problems  where  the  scalar  of 

interest,  say  A^,  is  estimated  by  7\  having  cdf  F( . J A ^ ) the  following 

holds:  A^  < Aj  =>  F ( . | A ^ ) <gt  F( • | X ^ ) and  hence  the  stronger  property  of 

monotonicity  holds.  However  x (F.)  < x (F.)  =h>  F.  < ^ F.  and  hence  the 

a i a D i st  3 

property  of  weak  monotonicity  is  introduced  here.  An  example  will  be 
given  later  to  show  that  neither  RF(s,n)  nor  R^(s,H',n)  is  even 
weakly  monotone  wrt  or  Slt. 

Remark  5.6.  Since  Fj-  c F for  any  internal  I C R,  Theorem  5.3  holds  if 
ana  ft^  are  defined  as  the  appropriate  subsets  of  Fj  rather  than 


The  next  result,  obtainable  from  straightforward  computations,  describes 
the  behavior  cr  the  infimum  of  P[CS  |RF(s,n)]  over  ft,.,  as  c increases 
to  1. 


Lemma  5.2.  lim  inf  P[CS |R, (s,n)]  = B[B[a];  k-t-s+r,  k-tj  x 
c*-M.-ft, 

it 


(1  - B[B( 


a);  t-r+1,  t])  < (k-t-s+r-l)(k  ?)  1 (.*  )/(k-t-s+j-l 


s-i  . ]-r 

j =r 


>(  ,) 


■ I 

« * ■ - 


44 


^ J(» 

When  r=t  the  right  hand  side  reduces  to  ( )/(  ) which  is  the 

s-t  s 

probability  of  making  a correct  selection  by  randomly  selecting  s popula- 
tions when  exactly  t are  preferred.  So  for  any  fixed  sample  size  n _> 

there  exists  c sufficiently  close  to  1 and  an  F e Q.,.(t)  for  which 

% 

PF[CS|R1(s,n)]  < Pp[CS  | choose  s populations  at  random].  To  gain  an 
% 'v 

intuitive  feel  for  this  result  choose  an  artitrary  F e fi*(t,l).  Then 
V x e [c*,l] 


(5.9)  > ^[i]^X^  V 1 H j k-t  and  k-t+1  _<  i <_  k. 


However  (5.9)  need  not  hold  for  x^:  [ c “ , 1 ] . In  fact  given  e > 0 there 

exists  F = F(e)  e ft.,.(t,l)  such  that  for  all  x$[c“-€,  1+c] 

% % 


(5.10)  F[j]^x^  < ^[i]^x^  V 1 j k-t  and  k-t+1  < i <_ 


k. 


In  such  a case  unless  n is  sufficiently  large  to  detect  the  situation 
in  [c“’,l]  the  procedure  Rp(s,n)  will  tend  to  reflect  (5.10)  and  choose 
populations  in  (n^ p) > * • • 

The  preceding  argument  can  be  used  to  show  that  Rp(t,n)  is  not 

weakly  monotone  wrt  f2.,{.  For  fixed  n and  c"  sufficiently  close  to  1 

choose  F e fl.(t)  such  that 
% '* 


(5.11) 


Pp[CR|k1(t,n)]  = PF[kt|R1(t,n)]  < !/(*) 


where  kt  = (k-t+1, ... ,k).  Let  {^( «. ) 1 1 <_  l <_  (t)}  be  the  collection  of 


all  sets  of  size  t from  {l,...,k}.  Then 
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1 Prtj(^  ) I R,  (t  ,n)]  = 1 and  (5.11)  =">  3 a j*(£)3 
X,=l  % i 

PP[j*(4)|R.(t,n)]  > l/(*)  > P_[k  |R.(t,n)] 

l * 1 1 l ^ 1 


and  hence  R^(t,n)  is  not  weakly  monotone.  Similar  examples  exist  when 

(2*  or  R^(s,n)  is  replaced  by  (2^  or  R2(s,f,n)  respectively. 

Some  results  will  now  be  developed  for  the  number  of  populations 

selected  by  R2(s,4',n).  Let  E(i,n)  = [T^  rnaxtTp^  s+1]»  ^^(T.-^)}] 

be  the  event  that  u(i)  is  selected,  P(i,n)  = Ej,[E(i,n)]  and 
k 'i 

S(n)  = I X (T)  the  number  of  populations  selected  by  R2(s,T,n).  It 
i=l  E(i,n) 

is  straightforward  to  obtain  an  expression  for  Ep[S(n)]  for  arbitrary 
F based  on  this  representation  for  S(n). 

For  all  F e F and  as  (0,1)  it  is  well  known  that  T,  . . 4L*x  (F) 

V i / ^ 

where  T, . , has  cdf  B[F(y ) :q,n](n  > N ).  The  following  results  are 
(i),n  — a 

a direct  consequence  of  this  fact  and  the  proof  of  Theorem  5.2  and  Corollaries 
5.1  and  5.2  of  Santner  (1975). 


Theorem  5.4.  For  any  F e (2(F)  3 Xa(Ej-k-j)  > ) 


(1) 

PF(i. 

n)~>  { 

' 1 
- 0 

, i = k 
, 1 < i < 

"\  as 
k' 

n 

OO 

5 

(2) 

Z.  (n) 

r 1 

, i = k 

n as 

n 

00  and 

l 

{ 

-0 

. 1 I i < 

k / 

(3) 

S(n) 

Jii 

and 

E_[S(n)> 

1 as  n 

->  OO 

where 

, 2 

convergence  in  the  L norm. 


denotes 


The  final  topic  of  this  section  is  a proposal  for  choosing  the  constant 
P e (1,®)  when  the  sequence  £pe^n^  is  used  to  determine  the  rule 

R2(s,T,n).  The  discussion  is  limited  to  the  star  ordered  case  but  similar 
results  hold  for  the  tail  ordered  case. 
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I 


" 

.■„■■■.-■  ^ i/i—  i 


Let  [NPS|R2(s,'*’,n) ] = [R  (s,^,n)  selects  at  least  one  non  preferred 
population]  and  for  P e (l,09)  let  n(p)  be  the  smallest  n >_ 
satisfying  both 


(5.12) 


inf  P [CS|R  (s,H',n)]  j>  Pi:  and 
0*  * 


(5.13) 


sup  Pp[NPS|R2(s,4',n)]  _<  e‘‘ 


for  specified  e'e  (0,1)  and  P" e (l/k,l).  The  optimal  choice  of  p 
minimizes  n(p).  Arguments  similar  to  those  of  Section  4.2  show  the 
supremum  of  (5.13)  occurs  at  F(l,l).  Calculation  gives 


(5.14)  PF(1  1)CNPS|R2(s,y,n)]  = 1- 


(B(F*  5(y/pe(n)|c*))}k_1dB(FA  (y | 1 ) ) 


1 - 


it  l//n 
C p 


[B(G(y/c*p1/’/"))]k“1dB(G(y)) 


= 1 - [B(G(l/cftp1/’/^))]k‘1[l  - B( a) ] 


* l//n 
,c  p >1 


it  1/Al 
,c  p <1 


Remark  5.8.  For  fixed  pe(l,»)  the  righthand  side  of  (5.14)  converges 


to  zero  as  n ® and  hence  V g*  e (0,1)  3 n >N  satisfying  (5.13). 

— a 

For  each  p e (1,®)  let  n^p)  be  the  smallest  n > satisfying 
(5.12)  and  let  n2(p)  be  the  smallest  n _>  satisfying  (5.13).  Since 
both  sup  P [NPS|R  (SjTjn)]  and  inf  P [CS|r  (s,T,n)]  are  nondecreasing 

fi.,.  i 1 n...  i 1 

in  p it  follows  that  n^(p)  is  nonincreasing  in  p and  n2(P)  is 
nondecreasing  in  p.  An  optimal  choice  of  P is  P"  such  that 
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(5.1b)  max{n  (p*),n2(p“)}  = min  maxtn^p ) ,n2(p  ) } . 

l<p«» 

The  solution  of  (5.15)  can  be  obtained  via  the  same  techniques  employed 
in  the  construction  of  Tables  4.1  - 4.3  (see  Hooper  (1977)). 
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